EPJ manuscript No. 

(will be inserted by the editor) 



t> : 
o , 
o ■ 
(N ; 

Oh 1 
<D ' 
00 



Evolving pseudofractal networks 



Zhongzhi Zhang 1,2 , Shuigeng Zhou 1 ' 2 , and Lichao Chen 



1.2 



1 Department of Computer Science and Engineering, Fudan University, Shanghai 200433, China 

2 Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai 200433, China 



Received: date / Revised version: date 



43 

a,: 

6 
o 

in 
O 

43 : 



> 

OS 

On 
O 

o 



X 



Abstract. We present a family of scale-free network model consisting of cliques, which is established by a 
simple recursive algorithm. We investigate the networks both analytically and numerically. The obtained 
analytical solutions show that the networks follow a power-law degree distribution, with degree exponent 
continuously tuned between 2 and 3. The exact expression of clustering coefficient is also provided for 
the networks. Furthermore, the investigation of the average path length reveals that the networks possess 
small-world feature. Interestingly, we find that a special case of our model can be mapped into the Yule 
process. 

PACS. 89.75.-k Complex systems - 89.75.Fb Structures and organization in complex systems - 05.10.-a 
Computational methods in statistical physics and nonlinear dynamics 



1 Introduction 

Over the last few years, it has been suggested that a lot 
of social, technological, biological, and information net- 
works share the following three striking statistical charac- 
teristics |l|2|3|4j : power-law degree distribution [5], high 
clustering coefficient j6], and small average path length 
(APL). Power-law degree distribution indicates that the 
majority of nodes (vertices) in such networks have only 
a few connections to other nodes, whereas some nodes 
are connected to many other nodes in the network. Large 
clustering coefficient implies that nodes having a common 
neighbor are far more likely to be linked to each other than 
are two nodes selected randomly. Short APL shows that 
the expected number of links (edges) needed to pass from 
one arbitrarily selected node to another one is low, that 
is, APL grows logarithmically with the number of nodes 
or slower. 

In order to mimic such complex real-life systems, a 
wide variety of models have been proposed 1 2 3 4J , among 
which the most well-known successful attempts are the 
Watts and Strogatz's (WS) small-world network model [6] 
and Barabasi and Albert's (BA) scale-free network model 
[5], which have attracted an exceptional amount of at- 
tention from a wide circle of researchers and started an 
avalanche of research on the models of systems within 
the physics community. After that, a considerable num- 
ber of other models and mechanisms, which may repre- 
sent processes more realistically taking place in real-life 
systems, have been developed. These mainly include non- 
linear preferential attachment [7J , initial attractiveness [5] , 
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edge rewiring [3] and removal [TU], aging and cost [TT] . 
competitive dynamics [12] , duplication [13] , weight [14115] , 
geographical constraint [16|17j . Apollonian packing [18|19|20|21|22|23] 
and so forth. Today, modeling complex systems with small- 
world and scale-free characteristics is still an important 
issue. 

Recently, Dorogovtsev, Goltsev, and Mendes have demon- 
strated that scale-free behavior and small-world effect can 
be excellently modeled by using pure mathematical ob- 
jects and methods to construct a deterministic graph [24J, 
called pseudofractal scale-free network (PSFN) which was 
extended by Cornelias et. al. in [25]. PSFN has drawn 
much attention from the scientific community, many dy- 
namical processes taking place in PSFN have been in- 
tensively studied, including synchronization [27], diffusion 
[28], and opinion formation [29]. The PSFN is a regular 
deterministic network in a certain sense without statistical 
mechanics for consideration. In [30] , Dorogovtsev, Mendes 
and Samukhin proposed a random growing network, which 
we call random pseudofractal scale-free network (RPSFN). 
The PSFN and RPSFN may provide valuable insight into 
some particular real-life networks. 

In this paper, we propose a general scenario for con- 
structing evolving pseudofractal networks (EPNs) governed 
by three parameters to, p, and q, which control the rele- 
vant network characteristics. The EPN unifies the PSFN 
and RPSFN to the same framework, i.e. the PSFN and 
RPSFN are special cases of EPN. In addition to PSFN and 
RPSFN, the EPN also includes many other models as its 
particular cases. More interestingly, one particular case of 
EPNs can be mapped into the Yule process. The growing 
EPNs are composed of cliques, and result in a power-law 
degree distribution with degree exponent changeable be- 
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t = 1 t = 2 

Fig. 1. Illustration of a deterministically growing network in 
the case of m = 2, p = 1, and q — 2, showing the first three 
steps of growing process. 



tween 2 and 3, a very large clustering coefficient, and a 
small-world feature. 



2 Network construction 

We construct the networks in a recursive manner and de- 
note the networks after t generations by Q(q,t), q > 2,t > 
0. Then the network construction process is as follows: For 
t = 0, Q(q 7 0) is a complete graph K q+ i (or (q+ l)-clique). 
For t > 1, Q(q,t) is obtained from Q(q,t — 1). For each 
of the existing subgraphs of Q(q,t — 1) that is isomorphic 
to a q~ clique, with probability p (0 < p < 1), to (to is 
a positive integer) new vertices are created, and each is 
connected to all the vertices of this subgraph. The grow- 
ing process is repeated until the network reaches a desired 
size. Figure [1] shows the network growing process for a 
particular case of m = 2, p = 1, and q = 2. 

There are at least five limiting cases of our model listed 
below, (i) When to = 1, p = 1, and q — 2, the networks are 
exactly the same as the pseudofractal scale-free network 
(PSFN) [24]. (ii) When to = 1, p -» (but p ^ 0), and q = 
2, our model is reduced to the random pseudofractal scale- 
free network (RPSFN) [3D], (iii) When to = 1, < p < 1, 
and q — 2, our networks coincide with the stochastically 
growing scale-free network described in [31] , (iv) When 
to = 1, p = 1, and q > 2, our networks reduce to the 
recursive graphs discussed in [25j . (v) When p = 1 and 
q > 2, our networks turn out to be the graphs introduced 
in [26]. Thus, varying parameters to, p, and we can 
study many crossovers between these limiting cases. 

Next we compute the numbers of nodes and links in 
Q{q 1 t). Let L v (t), L e (t) and K qt be the numbers of ver- 
tices, edges and g-cliques created at step t, respectively. 
Note that the addition of each new node leads to q new 
g-cliques and q new edges. So, we have L e (t) = K qt = 
qL v {t). Then, at step 1, we add expected L v {\) — mp(q + 
1) new nodes and L e (l) — mpq(q+l) new edges to Q(q, 0). 
After simple calculations, one can obtain that at tj(tj > 1) 
the numbers of newly born nodes and edges are L v (ti) = 
mp{q + 1)(1 + TOpg)* i_1 and L e (ti) = mpq(q + 1)(1 + 



mpq) ti , respectively. Thus the average number of total 
nodes N t and edges E t present at step t is 

N t = ±L v (U)= {q + mmPq+1)t+q - 1] (1) 
u=o q 

and 

E, = ±L, M = ( « + mmm t ir + { <- 2)] , (2) 

ti=0 

respectively. So for large t, The average degree k t = 
is approximately 2q. 

3 Topological properties 

Topology properties are of fundamental significance to un- 
derstand the complex dynamics of real-life systems. Here 
we focus on three important characteristics: degree dis- 
tribution, clustering coefficient, and average path length, 
which are determined by the tunable model parameters 
to, p, and q. 

3.1 Degree distribution 

Degree distribution is one of the most important statistical 
characteristics of a network. Firstly, we follow the method 
that was introduced in [20|23j for the calculation of degree 
distribution for the general case; then, we use the rate- 
equation approach [32j to get the degree distribution for 
some limiting cases. 

3.1.1 General case 

When a new node i is added to the networks at step i,;, 
it has degree q and forms q g-cliques. Let L q (i,t) be the 
number of g-cliques at step t that will possibly create new 
nodes connected to the node i at step t + 1. At step ti, 
L q {i,ti) — q. By construction, we can see that in the sub- 
sequent steps each new neighbor of i generates q — 1 new 
q-cliques with i as one node of them. Then at step ti + 1, 
there are mpq new nodes which forms mpq(q — 1) new q- 
cliques containing i. Let ki(t) be the degree of i at step t. 
We can easily find following relations for t > ti + 1: 

Aki(t) = h(t) - ki(t - 1) = m P L q (i, t - 1) (3) 

and 

L q {i, t) = L q {i, t-l) + (q- l)Aki{t). (4) 

From the above two equations, we can derive: L q (i,t + 
1) = L q (i, t)[l + mp(q — 1)]. Since L q (i, ti) = q, we have 
L q (i,t) = q[l+mp(q— l)] t ~ ti and Aki(t) = mpq[l+mp(q— 
l)] t_ ** _1 . Then the degree ki(t) of node i at time t is 

hit) = ki(U) + J2t h =u+i Aki{t h ) 

= q ^ [ 1 + m P(q- 1 )l'~ t '+'?- 2 ^ , (5) 
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Since the degree of each node has been obtained explicitly 
as in Eq. (|5|) , we can get the degree distribution via its cu- 
mulative distribution [3], i.e., P C um{k) = jf- J2k>>k -^Wi 
~ fc 1-7 , where N(k',t) denotes the number of nodes with 
degree fc'. The detailed analysis is given as follows. For a 
degree fc 



fc = q 



[l + mp(q- !)]*-* + q- 2 
q-1 



(0) 



there are L v (s) — mp(q + 1)(1 + mpg) s_1 nodes with this 
exact degree, all of which were born at step s. All nodes 
born at time s or earlier have this or a higher degree. So 
we have 



X;w(*',t)=EMa) = 



k<>k 



a=0 



(q+l)[(mpq + iy + q - 1] 

q 



As the total number of nodes at step t is given in Eq. fTJ) 
we have 



[l + mp{q- l)} t - s + q-2 



1-7 (q+l)l(mpq+iy+q-l] 

_ q 

(q+l)[(mpq+iy+q-l] 

q 



1-1/5 

Therefore, for large t we obtain 

[[1 + mp(q - I)]*" 8 ] 1-7 = (1 + mpq) 



and 



7 



1 



ln(l + mpq) 
ln[l + mp(q — 1)] 



(7) 



(8) 



Thus, the degree exponent 7 is a continuous function of 
m, p and q, and belongs to the interval [2,3], coincid- 
ing with the empirically found results. In some limiting 
cases, Eq. ([8]) recovers the results previously obtained in 
Refs. [24125126130(51] . Figure [2] shows, on a logarithmic 
scale, the scaling behavior of the cumulative degree dis- 
tribution Pcum(k) for different values of p in the case of 
m = 1 and q = 2. Simulation results agree very well with 
the analytical ones. 



3.1.2 Rate equation for some limiting cases 

When m = 1 and p — > (but p =/= 0), our model turns out 
to be the graph, which we call expanded random pseud- 
ofractal scale-free network (ERPSFN) that evolves as fol- 
lows (see [31] for interpretation): starting with a (q+1)- 
clique (t — 0), at each time step, we choose an existing 
g-clique, then we add a new node and join it to all the 
nodes of the selected g-clique. When q= 2, the particu- 
lar model is exactly the random pseudofractal scale-free 
network (RPSFN) [30]. 

In fact, the expanded random pseudofractal scale-free 
network can be easily mapped into the Yule process [33134] , 
which was inspired by observations of the statistics of bi- 
ological taxa. The Yule process can be prescribed math- 
ematically as follows: we measure the passage of time by 
the number of genera. At each time step one new species 
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Fig. 2. The cumulative degree distribution P cum (fc) at various 
p values for the case of m = 1 and q — 2. The circles (a), 
squares (b), stars (c), and triangles (d) denote the simulation 
results for networks with different evolutionary steps t = 1350, 
t = 25, t — 16, and t = 13, respectively. The four straight lines 
are the theoretical results of 7(m,p, q) as provided by equation 
©. All data are from the average of 50 independent runs. 



founds a new genus, thereby increasing the number of gen- 
era by 1, and q other species are added to various pre- 
existing genera which arc selected in proportion to the 
number of species they already have. Let the nodes and 
q-cliques in ERPSFN correspond to genera and species, 
respectively, then the mapping from ERPSFN to the Yule 
process is established. From this perspective, our model 
may find some applications in biological systems. Next, we 
show that the degree distribution of ERPSFN is power-law 
with the same degree exponent as the Yule distribution. 

Since the size of ERPSFN is incremented by one with 
each step, here we use the step value t to represent a 
node created at this step. Furthermore, after a new node 
is added to the network, the number of g-cliques increases 
by q. We can see easily that at step t, the network consists 
ofN = t + q+l nodes and N q = qN — q 2 + 1 ^-cliques. 

One can analyze the degree distribution mathemati- 
cally as follows. Given a node, when it is born, it has de- 
gree q, and the number of g-cliques containing this node 
is also q. After that, when its degree increases by one, the 
number of g-cliques with this node as one of its compo- 
nents increases by q — 1 , so the number of g-cliques for se- 
lection containing a node with degree fc is (q— l)fc — q 2 + 2q. 
We denote by Pk,N the fraction of nodes with degree fc 
when the network size is N. Thus the number of such 
nodes is NPk.N- Then the probability that the new node 
happens to be connected to a particular node i having de- 
gree fc, is proportional to (q — l)fcj — q 2 + 2q, and so when 
properly normalized is just [{q — l)fc; — q 2 + 2q]/(qN — 
q 2 + 1). Hence, between the appearance of the iVth and 
the (N + l)th node, the total expected number of nodes 
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with degree k that gain a new link during this interval is 



(q - l)k -q 2 + 2q 
qN - q 2 + 1 



x NP k 



k,N 



kP k 



k.N, 



(9) 



which holds for large N. Observe that the number of nodes 
with degree k will decrease at each time step by exactly 
this number. At the same time the number increases be- 
cause of nodes that previously had k — 1 degrees and now 
have an extra one. Thus we can write a rate equation 32J 
for the new number (N + l)Pk,N+i OI nodes with degree 
k as: 



NP k . N + ^- \{k - l)P k -i, N ~ kP k . N ] . 

q 



(N+l)P kiN+1 

a 

(10) 

The only exception to Eq. (|T0|) is for nodes having degree 
q, which instead obey the equation 



(N + l)P q>N+1 



NP, 



q,N 



1 



1 



<1 



qP q ,N, (ii) 



since by construction exactly one new such node appears 
at each time step. When N approaches infinity, we as- 
sume that the degree distribution tends to some fixed 
value P k = limjv_»oo Pn,u- Then from Eq. (|lip . we have 



(12) 



And Eq. (fT0|) becomes 



Pk 



1 



[(k - l)P fe _! - kP k ] 



(13) 



from which we can easily obtain the recursive equation 

k - 1 



■Pi 



fc-ij 



q— 1 

which can be iterated to get 

(jk-l)(k-2)...q 



(14) 



Pk = 



(k + l + ^ T )(k + 1 ±- 1 )...(q + 2+^ T ) 

(fc-l)(fc-2)...(g+l) 
{k + 1+ l ){k+ l )...( q + 2 + -L- )' 



(15) 



where Eq. (fT2"|) has been used. This can be simplified fur- 
ther by making use of a handy property of the P-function, 
r(a) = (a - l)r(a - 1) with P(a) defined by: 



r(a) 



„a—X-—x 



dx. 



(16) 



By this property and P(l) = 1, we get 

_ (g + i + ^)(g + ^)---(2 + ^)r(fc)r(2 + i i r ; 

(? + 1 + ^t)(9+^t)...(2 + ^t) 



9 (g -!)...! 



-B(fc,2- 



(17) 



where B(a,6) is the Legendre beta-function, which is de- 
fined as 

r(a)r(6) 

B(a ' 5) = 7>T5y' (18) 

Note that the beta-function has the interesting property 
that for large values of either of its arguments it itself 
follows a power law. For instance, for large a and fixed b, 



B(a,b) 



Then we can immediately see that for large 



k, P k also has a power-law tail with a degree exponent 



7 



q-1 



(19) 



For 



q = 2, 7 = 3, which has been obtained previously 
in [31] . Note that Eq. (fl7|) is similar to the Yule distribu- 
tion [35] called by Simon [36] , 



3.2 Clustering coefficient 

In a network if a given node is connected to k nodes, de- 
fined as the neighbors of the given node, then the ratio 
between the number of links among its neighbors and the 
maximum possible value of such links k(k — l)/2 is the 
clustering coefficient of the given node [6]. The cluster- 
ing coefficient of the whole network is the average of this 
coefficient over all nodes in the network. 

For our network, the analytical expression of clustering 
coefficient C(k) for a single node with degree k can be 
derived exactly. When a node is created it is connected to 
all the nodes of a g-clique, in which nodes are completely 
interconnected. So its degree and clustering coefficient are 
q and 1, respectively. In the following steps, if its degree 
increases one by a newly created node connecting to it, 
then there must be q — 1 existing neighbors of it attaching 
to the new node at the same time. Thus for a node of 
degree k, we have 



q(g-i) 



C(k) 



+ (q-l)(k-q) 2(«-l)(fc-§) 



fc(fc-i) 

2 



jfc(fc-l) 



(20) 



which depends on both k and q. For k 3> q, the C(k) is 

inversely proportional to degree k. The scaling C(fc) ~ fc" 1 

has been found for some network models [1811912012112212312412512613013 

and has also been observed in several real-life networks [37] . 

Using Eq. ([20]). we can obtain the clustering C t of the 
networks at step t: 



t 

E 

r=0 



2{g-l){D r -l)L v {r) 
D r {D r - 1) 



(21) 



where the sum runs over all the nodes and D r is the degree 
of the nodes created at step r, which is given by Eq. ([5]). 

In the infinite network order limit (N t — > oo), Eq. (|2"T| 
converges to a nonzero value C . Obviously, network clus- 
tering coefficient Ct is a function of parameters m, p and 
q. If we fixed any two of them, C t increases with the rest. 
Exactly analytical computations show: in the case m = 1 
and q — 2, when p increases from to 1, C grows from 
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Fig. 3. The dependence relation of network clustering coeffi- 
cient C on m, p, and q. Results are averaged over ten network 
realizations for each datum. 











■ q=2,p=0.6 






• q=3,p=0.6 


■ 


- 


■ q=2,p=0.98 






m n-3 9-0 9R 



























































1-5 L 4 

10 10 10 10 



N 

Fig. 4. Semilogarithmic graph of the APL vs the network size 
N in the special case of m = 1. Each datum point is obtained 
as an average of 50 independent network realizations. The lines 
are linear functions of IniV. 



0.739 [3H] to 0.8 [23; In the case p = 1 and q = 2, when 
m increases from 1 to infinite, C grows from 0.8 24J to 1; 
Likewise, in the case to = 1 and p = 1, C increases from 
0.8 to 1 when q increases from 2 to infinite, with special 
values C = 0.8571 and C = 0.8889 for q = 3 and q = 4, re- 
spectively. Therefore, the average clustering coefficient is 
very large, which shows the evolving networks are highly 
clustered. Figure [3] exhibits the dependence of the cluster- 
ing coefficient C on to, p and q, which agree well with our 
above conclusions. 

From Figs. H and [3] and Eqs. © and [[2"T]). one can 
see that both degree exponent 7 and clustering coefficient 
C depend on the parameter m, p, and q. The mechanism 
resulting in this relation should be paid further effort. The 
fact that there is a biased choice of the cliques at each 
evolving step may be a possible explanation, see Ref. [39 j . 



3.3 Average path length 

The most important property for a small-world network is 
a logarithmic average path length (APL) (with the num- 
ber of nodes). It has obvious implications for the dynam- 
ics of processes taking place on networks. Therefore, its 
study has attracted much attention. Here APL means the 
minimum number of edges connecting a pair of nodes, av- 
eraged over all pairs of nodes. In this subsection, first we 
give an upper bound of APL for the general case; then, 
we compute exactly the APL for a particular deterministic 
network. Both of the obtained values grow logarithmically 
with the network size. 



3.3.1 An upper bound of APL for general case 

We denote the network nodes by the time step of their 
generations, v = 1, 2, 3, • • • , N — 1, N. Using L(N) to rep- 
resent the APL of the our model with system size N, 



then we have following relation: L(N) — j^fzjj , where 
cr(N) — J2i<i<j<N is the total distance, in which dij 
is the shortest distance between node i and node j. By 
using the approach similar to that in |17|21|22|23j , we can 
evaluate the APL of the present model. 

Obviously, when a new node enters the networks, the 
smallest distances between existing node pairs will not 
change. Hence we have 



N 



a(N+l) = a(N)+J2d i!N+1 . 



(22) 



»=i 



Equation (|22p can be approximately represented as: 

a{N + 1) = a{N) + N + (N - q)L(N -q + l), (23) 
where 

t i y - qW j y -. + D- 3gg^a±a <3ggQ. (24) 



N-q + 1 



N 



Equations (f23|) and |24|) provide an upper bound for the 
variation of a(N) as 



da(N) =N+ 2a(N) 



dN 



N 



(25) 



which yields 

a(N) = N 2 {\uN + uj), (26) 

where u> is a constant. As cr(N) ~ N 2 In N, we have L(N) ~ 
In AT. 

Note that Eq. (|25p was deduced from an inequality, 
which implies that the increasing tendency of L(N) is at 
most as lnA^ with N. Thus, our model exhibits the pres- 
ence of small-world property. In Fig. 21 we show the de- 
pendence of the APL on system size A^ for different p and 
q in the case of to = 1. From Fig. [4j one can see that for 
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A 




Fig. 5. The network after t+1 generations, Qt+i, is obtained 
by joining three copies of generation t (i.e. Qt ,QtiQ^) a t 
the three nodes of highest degree, denoted by A, B, C. 

fixed q, APL decreases with increasing p; and for fixed p, 
APL is a decreasing function of q. When network size TV 
is small, APL is a linear function of In TV; while N be- 
comes large, APL increases slightly slower than In TV. So 
the simulation results are in agreement with the analyti- 
cal prediction. It should be noted that in our model, if we 
fix p and q, considering other values of m greater than 1, 
then the APL will increase more slowly than in the case 
m = 1 as in those cases the larger m is, the denser the 
network becomes. 



Eq. ([29]) is 

t-i 

D t =3*- 1 L>i+^3 t - r - 1 Z\ T . (30) 

r=l 

The paths that contribute to A T must all go through at 
least one of the three hubs (A, B, C) where the three 
different Q t branches are joined. Below we give the ana- 
lytical expression for A t named the crossing paths, which 
is given by 

A t = A 1 / + A 2 ' 3 + A]' 3 , (31) 

where A"' 13 denotes the sum of all shortest paths with 

endpoints in Q[ a ^ and Q^f\ It should be noted that Af'^ 
excludes the paths where either endpoint is the hub they 
have in common, and includes only one of the paths from 
the unshared hub in one Qt branch (e.g. Ql^) to all non- 
hub nodes in the other Q t branch (e.g. Q\^)- 
By symmetry, A\' 2 — A 2 ' 3 = A\' 3 , we have 

A t = ZA\< 2 , (32) 

where A t ' is given by the sum 

A 1,2 = E *j ( 33 ) 

i^A,C, j^A 

In order to find A t ' , we define 



3.3.2 Exact result of APL for a special case 

For p = 1, the networks are deterministic, which allows 
one to calculate the APL analytically. Here we only con- 
sider a particular case of to = 1, p = 1, and q = 2, which 
we denote after t generations by Qt- Then the average 
path length of Qt is defined to be: 



D, 



where 



N t (N t - l)/2 ' 



A = E d ^ 



(27) 



(28) 



The deterministic recursive construction of this par- 
ticular network has a self-similar structure that allows us 
to exactly calculate dt by following a similar approach in- 
troduced in [40]. As shown in Fig. El the network Qt+i 
may be obtained by joining at the hubs (the most con- 
nected nodes) three copies of Q t , which we label Q[ a \ 
a = 1, 2, 3 [28] , Then one can write the sum over all short- 
est paths D t +i as 



3A + A t 



(29) 



d) 



tot _ 



E dz ' A > 



1 1 ( ■ ■■ «.r — ^ ( 

zeQ^ 

dz,A<dz,c 



jmid 



jfar 



E 



zeQl 

dz,A=dz,c 



E 



IZ.A 



lZ,A 



IZ.A 



jyncar 

N mid 



N. 



far 



E 

ZeQ^ 
dz,A<dz,c 

E i 

dz,A=dz,c 

E i 



(34) 



in 



Z£Q\ 
iz,A>d z ,c 



(i) 



zeQ 

iz,A>dz,c 



where Z ^ A and Z ^ C. Then we can easily have d\ ot — 

2. By 



symmetry N t 



far 
near 



near + jy. 



mid 



far 



Nf, Thus, by construction, we obtain 



N t = 2 7V t ncar + 7V t mia + 2, 
N™f = N™ id + 2 7V t noar + 1. 



(35) 



Using these two relations and considering the initial val- 
ues, we obtain partial quantities in Eq. (|34| as 



where A t is the sum over all shortest paths whose end- 
points are not in the same Qt branch. The solution of 



iV t far = 

N mid 



^yncar 



i (3 + 3**) 



-3 + 3 t+1 ), 



(36) 
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Now we return to the quantity A t ' which can be further 
decomposed into the sum of four terms as 



A 1 ' 2 = 



E 



^,3 



E 

i^Q'SK 3dQ?\ i±A,C, &A,B 



(<Uc + dj.B + 1) 



li,A>d it c, 



>,A>a 



E 



(ck,A + dA,j) 



ijtA,C, jj£A,B, d iiA <di. c 

E 

ieQt 1 ', 3&Q[ 2 \ &A,C, j^A,B 



(d iiA + d,A,j) 



+ 



ii,A>di t c, dj : A<dj,£ 

E 



= 2iV t noar 4 loar 



i^A,C, j=B 

near \ 2 



(AT* 

(JV* 
(A?' 



^v f mid )(^r 



1) 



2)(c£ 



near 



dT 



d ) 



^mid^fa 



+ jv „ear^„car + d mid) 



d „oar 



(37) 

Having A)* 2 in terms of the quantities in Eq. (j34|) . the next 
step is to explicitly determine these quantities unresolved. 

Since A and C are linked by one edge, for any node 
i in the network, d^A and ^i,c can differ by at most 1. 
In addition, considering the self-similar structure of the 
graph, we can easily know that at time t+1, the quantities 
d^li, dflY and 4+1 evolve as 



imid 
"t+1 


= df d H 


- 2 df v + 


1, 




ni car 
"t+l 


= <c d - 


f 2 ar 




(38) 


.tar 
"t+l 




- 2 4 ar + 







From these recursive equations we can obtain 

■5), 
-2), 

-l)-3 t+1 



' df d = 3*~ 2 (t - 

d „car = 3 t-2 ^ 

e£ ar = — (2(t- 
t 54 V V 



(39) 



11 • 3 



t+i 



27 



Substituting the obtained expressions in Eqs. (J36J) and 
([39]) into Eqs. (|37| and (|32| . the crossing paths A is ob- 
tained to be 



A = -[(4* + 13) 9* -9] 



12 

Inserting Eq. (HP)) into Eq. (f3T)]) and using D\ 
have 

£> t = - Ut ■ 9* + 10 -3* + 11 ■ 9* +3) . 

8 



(40) 
21, we 

(41) 



Substituting Eqs. ([T]) and (HH) into (|2T|) . the exact expres- 
sion for the average path length is obtained to be 



U- 9* + 10 -3* + 11 -9* 
3 + 4-3 t+1 +9 t+1 



(42) 



In the infinite network size limit (t — > cx)), 

4 11 

dt -*+y~lniVt, 



(43) 



which means that the average path length shows a loga- 
rithmic scaling with the size of the network. 

It should be mentioned that the final expressions con- 
tained in Eqs. (|4"2"]l and (|4"3"|) were quoted earlier in Ref. [M] 
[Eqs. (6) and (7) of that work]. However Ref. [H] did not 
provide any of the details of the derivation, so the explicit 
calculation presented here is pedagogically useful. More- 
over, the analytical method may guide and shed light on 
related studies for other deterministic network models. 



4 Conclusions and discussions 

In summary, we have proposed and studied a class of 
evolving networks consisting of cliques. We have obtained 
the analytical and numerical results for degree distribu- 
tion, clustering coefficient, as well as the average path 
length, which are determined by the model parameters 
and in accordance with large amount of real observations. 
The networks are scale-free, with degree exponent ad- 
justed continuously between 2 and 3. The clustering co- 
efficient of single nodes has a power-law spectra, the net- 
work clustering coefficient is very large and independent 
of network size. The intervertex separation is small, which 
increases at most logarithmically as the network size. 

In real-life world, many networks consist of cliques. For 
example, in movie actor collaboration network [5] and sci- 
ence collaborating graph [41], actors acting in the same 
film or authors signing in the same paper form a clique, 
respectively. In corporate director network [42] , directors 
as members in the same board constitute a clique. Anal- 
ogously, in public transport networks [43], bus (tramway, 
or underground) stops shape a clique if they are consec- 
utive stops on a route, and in the network of concepts 
in written texts [44], words in each sentence in the text 
are added to the network as a clique. All these pose a 
very interesting and important question of how to build 
evolution models based on this particularity of network 
component — cliques. Interestingly, our networks, although 
different from real world, are formed by cliques, this par- 
ticularity of the composing units may provide a compre- 
hensive aspect to understand some real-life systems. In 
future, it would be more interesting to establish a model 
describing real systems consisting of cliques such as ac- 
tor collaboration network where cliques arise from mutual 
cooperation [45|46j . 

Future work should also include studying in detail dy- 
namical processes taking place on our networks, which 
may provide some original and interesting results to the 
field. For example, it should be possible to adapt the 
renormalization-group techniques used in Refs. 40 47 48 
for studying the Ising model to examine cooperative be- 
havior on the general case of our model. There could be 
a variety of interesting phase transition behaviors as the 
network structure is modified through the parameters to, 
p, and q. 
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